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Abstract We present a sample of about 120,000 red clump candidates selected from the 
LAMOST DR2 catalog based on the empirical distribution model in the effective temper¬ 
ature vs. surface gravity plane. Although, in general, red clump stars are considered as 
the standard candle, they do not exactly stay in a narrow range of absolute magnitude, but 
may extend to more than 1 magnitude depending on their initial mass. Consequently, con¬ 
ventional oversimplified distance estimations with assumption of fixed luminosity may 
lead to systematic bias related to the initial mass or the age, which may potentially af¬ 
fect the study of the evolution of the Galaxy with red clump stars. We therefore employ 
an isochrone-based method to estimate the absolute magnitude of red clump stars from 
their observed surface gravities, effective temperatures, and metallicities. We verify that 
the estimation well removes the systematics and provide an initial mass/age independent 
distance estimates with accuracy less than 10%. 
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1 INTRODUCTION 

Red clump (RC) stars are metal-rich stars in the evolution phase of helium-core burning (Cassisi & 
Salaris [T9971 l. They play important roles in the study of the Galactic disk because they are widespread in 
the thin disk, usually considered as the standard candle (Pacyriski & Stanek 119981 1 and luminous (Girardi 
et al. 119981 Alves 120001 Groenewegen 12008l l and prominent population in color magnitude diagram, 
which makes them be easily identihed from multi-band photometry (Lopez-Corredoira et al. 12002b . 

Identihcation of individual held RC stars, however, is not trivial because they do highly overlap with 
the RGB stars. Pacyhski & Stanek (11998b used a Gaussian to model the distribution of magnitude for RC 
stars and a quadratic polynomial for RGB stars. Then the stars located in the Gaussian dominated region 
are very likely to be RC stars. This method was applied by Nataf et al.( 12013b to select the red clump 
stars in the Galactic bulge. Recently, Bovy et al.( 12014b employed a new method to identify the RC 
stars based on their distribution in color-effective temperature-surface gravity-metallicity and stellar 
evolution model. However, the authors only identify the primary RC stars and remove the secondary 
population to simplify the distance estimation. 

Compared to the identihcation, the distance estimation of RC stars are relatively simple with the 
assumption that the absolute magnitude of RC stars is around a hxed value with a small dispersion 
(Pacyhski & Stanek 119981 Girardi et al. ll998l Alves 120001 Groenewegen |2008l Liu et al. 12012b . 
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However, the stellar evolution model demonstrates that the RC stars are not always stay in the 
same luminosity. They are separated into two subclasses: the primary RC stars, which have electronic- 
degenerate cores, and the second RC stars, which contain non-degenerate He-cores (Girardi ri999b . In 
general, the primary RC stars are low-mass and hence older, while the secondary RC stars are massive 
and therefore younger than 1 Gyr. Most of the primary RC stars are generally brighter than the secondary 
RC stars, thus the former have smaller logp than the later (see Stello et al. 120131 1. In the context of the 
evolution of the Galactic disk, we intend to obtain a sample of RC stars with a wide range of the age 
so that we can trace the evolution of the Galaxy from precent day back to ^ 10 Gyr. For instance, 
Salaris & Girardi (120021 1 fitted the distribution of the Hipparcos (Perryman et al. 11997b observed RC 
stars, including both the primary and secondary populations, in color-magnitude diagram with a stellar 
evolution model and derived the distribution of age in the solar neighborhood. Although keeping both 
the primary and secondary RC stars in the sample is important in the study of the disk evolution, the 
distance estimation may turn out to be more complicated since the RC stars would not be the standard 
candle any more in this case. 

As the first paper of a series of works on the evolution of the Galactic disk with the RC stars 
from the LAMOST catalog, we firstly develop a new method of identification for both the primary 
and secondary RC stars and a normal approach of distance estimation to both populations. This paper 
is organized as below. In Section |2] we briefly introduce the LAMOST survey data and describe the 
empirical method of identification of RC stars. In Section[3 we develop the new method of the distance 
estimation. The performance of the estimates is then assessed in the same section. Finally, we further 
discuss the accuracy of our distance estimation in Section|4]and draw a brief conclusion in Section|5] 

2 IDENTIFICATION OF RC STARS 

In the section we identify the red clump stars from the LAMOST data using an empirical method. 

2.1 The LAMOST data 

The LAMOST, also called Guo Shou Jing telescope, is a 4-meter reflected Schmidt telescope with 4000 
fibers on the 20-squared degree focal plane (Cui et al. 120121 Zhao et al. 12012b . The LAMOST survey 
will finally observe more than 5 million low resolution stellar spectra after its 5-year survey (Deng et 
al. 120141 Liu et al. 12014b . According to Yao et al. (12012b . the site of LAMOST prefers to observe the 
winter sky, which covers the Galactic anti-center region. Therefore, there will be lots of disk populations 
located in the region to be observed by the LAMOST. In this work, we adopt the derived T^s directly 
from the LAMOST pipeline (Wu et al. 1201 lal Wu et al. 1201 Ibl Wu et al. 120141 Luo et al. in prep) 
and the estimated logp from Liu et al. (12015b . who estimate logp using support vector regression with 
the training dataset from the Kepler asteroseismic logp. The uncertainty of the logp estimates is only 
about 0.1 dex, which is a factor of 2 better than that from the LAMOST pipeline, for the metal-rich 
([Fe/H]> —0.6) giant stars, including the RC stars. 

We select the stars with logp between 0.9 and 3.5, [Fe/H] between -0.6 and 0.4, and Teff between 
3600 and 6000 K from the LAMOST DR2 catalog, which widely cover the RC region. We also remove 
all spectra with signal-to-noise ratio lower than 10 and finally obtain 279,423 stars. 

2.2 Identification of RC stars 

It is expected that lots of red clump stars in the disk population will be sampled by LAMOST survey. 
Indeed, Fig[T]shows that the RC stars from LAMOST DR2 catalog are prominent in Tes vs. logp plane. 
In this section, we establish an approach of identification of RC stars from the LAMOST data. 

Paczynski & Stanek (11998b used a Gaussian and quadratic polynomial to model the distribution of 
the magnitude for the RC and RGB stars, respectively. We expand this method into two dimensions in 
Teff vs. logp plane. 
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First, we empirically build a 2-D distribution model for the RGB stars in TeS vs. log^ plane with 
various metallicity bins ([Fe/H] = (—0.6, —0.3), (—0.3, 0.0), and (0.0, 0.4)). We mask the region be¬ 
tween logp= 2.0 and 3.0 to avoid the RC stars and fit the distribution of the rest RGB stars with the 
following empirical model, 

iVRGB(Teff, logp) = A^(log5) + 

(logp) 

where iV(logp) and cr^(logp) are three smoothing cubic spline functions with different metallicity bins 
([Fe/Fl] = (—0.6, —0.3), (—0.3, 0.0), and (0.0,0.4)), respectively; and 

( 150.21og5^ - 946.31ogg^ + 24171ogp + 2188 [Fe/H] e (-0.6, -0.3) 
r(log 5 ) = 49.81ogp^ - 374.11og5^ + 14351ogp + 2632 [Fe/H] e (-0.3,0.0) (2) 

i 81.651og5^ - 590.71ogp^ + 192Hogg + 2107 [Fe/H] e (0.0,0.4) 

Fig|2]shows the best fit curves for the terms in Eq[T}ll] From left to right, the range of metallicity is 
(—0.6, —0.3), (—0.3, 0.0), and (0.0, 0.4), respectively. N(\ogg) and cr^(logg) are the best fit smoothing 
spline functions shown in the top panels and bottom panels of Fig [2] respectively. The rest panels of 
Fig|2]show the best fit polynomials for the terms in Eq|2] 

Providing that the RGB stars are smoothly distributed along log^, we can interpolate the stellar 
distribution of the RGB stars between logp= 2.0 and 3.0 with the best fit model shown in Eqs[T]-l2] The 
middle panels of Eig|3]shows the distribution of RGB stars according to our model in Tgs vs. logp plane 
for various [Ee/H] bins in different rows. 

Second, we subtract the smoothing distribution model of RGB stars between log 5 = 2 and 3 and 
the residuals are mostly contributed by the RC stars, as shown in the right panels of Eig |2 in which 
the contours of 68% (red) and 95% (yellow) completeness of the residual distribution are overlapped. 
Compromise has to be made between the completeness for both primary and secondary RC stars and the 
fraction of the contamination from the RGB stars. We find that the 68% completeness contour cannot 
cover the most region of the secondary RC stars, therefore, we select the 95% completeness, as the 
recommended selection criterion of the RC stars. With this empirical distribution of RC stars, other 
users can freely adjust the criterion of selection to identify different sample of the RC stars to meet 
their specific requirements. It is noted that there are a few fractional areas far way from the empirical 
RC star region are also within the 95% completeness level. They may be artificial features because the 
data is too sparse in these regions. We then manually exclude three artificial areas with Tes> 5200K, 
logp< 2.1 dex and logp> 2.9 dex. Moreover, the bottom-right corner of the 95% completeness level 
in the bottom-right panel is apparently contributed by the red bump stars rather than the clump stars. 
Therefore, a fourth cut at log(;< O.OOlOTeff-4.7170 is added to remove the contamination from the 
bump stars. These additional data cuts are shown as white lines in the right panels of Eig [3 Einally, we 
find 118,711 RC candidates with the refined 95% completeness level. 

Although most of the stars located in the refined 95%-level region are RC stars, some contamina¬ 
tions may also included. Assuming that the residual distribution in T^g vs. logp planes of Eig[3are the 
distributions of the true RC stars, we can give the percentage of the true RC stars by dividing the resid¬ 
ual distributions with the full distributions, which contain both RC and RGB stars, although we cannot 
identify the individual RC stars. Eig |4| shows the percentage in T^g vs. log^ planes for metallicity bin 
(—0.6, —0.3), (—0.3,0.0), and (0.0, 0.4) from left to right, respectively. It shows that the fraction of the 
true RC stars are mostly larger than 75%, even for some regions of the secondary RC stars, in the can¬ 
didate catalog. The contour level of the 68% and 95% completeness, the numbers of the RC candidates 
under such completeness levels, and the total fraction of the true RC stars are listed in Tabled] 

Compared with the method provided by Bovy et al. (120141) , our method does not depend on the 
stellar model but only on the specific observations. Moreover, although the number of the secondary 
RC are less than that of the primary RC, we can still discriminate them from the background RGB stars 
with a fraction of 75%^85%, as shown in Eig|4| Therefore, the identification of RC stars in this work is 
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suitable for both primary and secondary RC stars, ensuring that the study of the evolution of the Milky 
Way can be extended from < 1 Gyr to around 10 Gyr based on this sample. 

3 DISTANCE ESTIMATIONS 

Most of the RC stars are located in the Galactic disk and their apparent magnitudes and color indices 
are significantly affected by the interstellar extinction. Therefore, the distance and the reddening have to 
be determined simultaneously. In the next section, we introduce a likelihood method to determine these 
quantities, and then we apply this method with fixed and various absolute magnitude, respectively, in 
the following two sections. 


3.1 A likelihood method to estimate distance and reddening 


The first step to estimate the distance of RC stars is to estimate the reddening from the observed color 
index. The LAMOST spectra do not have a perfect optical multi-band photometry by now. Its input 
catalog is a combination of UCAC4 (Zacharias et al. 120131) . PanSTARRSl (Tonry et al. 120121 . SDSS 
(Ahn et al. 120141 ). and Xuyi Galactic anti-center survey (Yuan et al. 120151 ). Although all of these source 
catalogs contains g, r, and i bands, they are still not well calibrated with each other, yet. Therefore, in 
this stage, we use the 2MASS catalog (Skrutskie et al. 120061) as the input catalog to derive the reddening 
and distance for RC stars. 

The likelihood of E{J — K) for a RC star given the observed J — K and the intrinsic color index 
of RC stars can be written as: 


Pr{E{J—K)\J—K, aj-K, {J—K)rc, ctrcj-k) ^ exp{— 


(^E{J-K)-{{J-K)-{J-K)rc)? 

+ '^RC.J-k) 

(3) 


where (J — K)rc is the intrinsic color index of RC stars, <Jrc,j-k the intrinsic color index dispersion 
of RC stars, and aj-K the measurement error of the observed J — K. To convert the reddening in J — K 
to the extinction in K band, we adopt Indebetouw et al. (120051) that 


Ak = 0.67E{J-K). 


(4) 


Then, the likelihood of the distance module, DM, for a RC star given the observed K magnitude and 
the fixed absolute magnitude M^f can be written as: 


Pr{DM\K, <7 k, Ak, Mk,cfmk) 




2 (< 


K 


'Mk) 


(5) 


where Mr is the fixed absolute magnitude in K band for RC stars, (Jmk the intrinsic dispersion of abso¬ 
lute magnitude in K band for RC stars, and aK is the measurement error of the apparent K magnitude. 


3.2 The fixed absolute magnitude and the intrinsic color index of the RC stars 

Combined with Eqs. [3-15] the likelihood of both £’( J — K) and DM for RC stars can be derived. 
The last ingredient needed to put in is the absolute magnitude, intrinsic color index, and their intrinsic 
dispersions for RC stars. Although some literatures provided the absolute magnitude in K band and 
the intrinsic absolute magnitude in J — AT for RC stars (Alves 120001 Groenewegen 120081 Zasowski 
et al. 120131 etc.), the intrinsic dispersions for both quantities, which are necessary in our likelihood 
method, are not self-consistently provided. Therefore, we estimate the absolute magnitude in K band, 
the intrinsic color index in J — AT, and their intrinsic dispersions with the Hipparcos data. 

Hipparcos catalog provides parallaxes for more than 100,000 bright stars, thousands of which lo¬ 
cated in the region of red clump in HR diagrams. In order to estimate the absolute magnitude of RC 
stars, we need to correct the reddening as the first step. Bailer-Jones (1201 lb estimated the extinction 
parameters for about 47,000 Hipparcos stars, sparsely distributed in the sky. We extend the extinction 
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to all Hipparcos stars using spatial interpolation. For a star of interest, we select all stars with reddening 
parameters derived by Bailer-Jones (1201II) within a 10-degree-radius circle and 20 pc in distance around 
it. Then we assign the median reddening value for all selected stars as the reddening value for the star 
of interest. In order to ensure the accuracy of the absolute magnitude, we select the stars with error 
of parallax smaller than 20% and errors of 2MASS photometry smaller than 0.5 mag. Fig |5] shows the 
J — K vs. M/f diagrams without (top-left panel) and with dereddening (middle-left panel) for about 
900 giant stars with Mk < 0 mag. 

We adopt the empirical model of the distribution in Mjf from Paczyhski & Stanek (119981 1. which 
has the following form: 


F = a + b(MK - c)^ -f dexp(- ), (6) 

where a, b, c, d, e, and / are the free parameters. The quadratic polynomial in Eq (|6]l models the stellar 
distribution of the RGB stars and the Gaussian term models the RC stars. 

Similarly, we use this model for the marginalized distribution of color index: 

F'= a'+ b'iJ -K- c'f + d'exp(- ~ ). (7) 

We fit the models for the absolute magnitude and color index without and with the dereddening, 
respectively. The top-right panel of Fig|5]shows the best fit for the marginalized reddened absolute mag¬ 
nitude with Eq (|6]l. The middle-right panel shows the best fit model for the marginalized dereddened 
absolute magnitude. And the bottom panel shows the best fit for the reddened (dashed line) and dered¬ 
dened (solid line) color index J — K, respectively. Table |2] lists the best fit absolute magnitude and 
intrinsic color index and their dispersions. It shows that the dereddend absolute magnitude and intrinsic 
color index are brighter and bluer than the reddened values by about 2%, respectively. In this work we 
adopt the dereddened Mjf and J — K as the standard value in the estimation of the distances for RC 
stars. 

3.3 The Mk based on the synthetic isochrones 

The assumption that the RC stars have a fixed magnitude can only work for the primary RC stars, which 
are mostly composed of the relatively older RC stars compared with the secondary ones. When we want 
to trace the evolution of the Galactic disk with RC stars, we cannot only use the primary RC stars and 
ignore the secondary RC stars. Therefore, we need to improve the approach to estimate the distance of 
RC stars so that the secondary RC stars are also taken into account. 

We turn to use a special isochrone fitting process to estimate the absolute magnitude for all kinds 
of RC stars. After quickly reviewing the isochrones, we realize that the absolute magnitude of RC stars 
is a function of logp, [Ee/H], and Teff. Panel (a) of Pig |6] shows the synthetic RC stars from PARSEC 
library (Bressan et al. 12012t in log^ vs. Mk plane. It shows that Mk is tightly dependent of logp. 
Purther separations of the data into different initial stellar masses are shown in the right panels. We find 
that the RC stars with stellar masses of 0.8 ~ 1.1 Mq (panel (b) in Pig|6]l mostly concentrate within 
Mk = —1.4 ~ —1.6 mag, while the M/f for the RC stars with initial mass of 1.1 ^ 1.4 Mq moves up 
to —1.6 ~ —1.8 mag (panel (c)). Then the Mk for the RC stars with initial mass at 1.4 ^ 1.7 Mq move 
back to the range of —1.5 ^ —1.7 mag (panel (d)). Por the RC stars with 1.7 ^ 2.0 Mq , intensively 
extends from ~ —1 to more than —2 mag (panel (e)). To further investigate how the Mk varies, we 
separate the synthetic stars into two groups at [Pe/H]= —0.3. Por the RC stars with [Pe/H]> — O.Sdex, 
the relation between Mk and logp can be empirically modeled with a quadratic polynomial (see the 
related panels in Pig|2]i: 

Mk = -Pilog^^ -f F2logg -f Pa (8) 

The best fit coeffecients (j = 1, 2, 3) are listed in Table|3] Por the stars with [Pe/H]< —0.3 dex, when 
logp< 2.45, the Mk is no longer a function of log^ (see the related panels in Pig IT)- Pig[8]shows that 
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for these stars, the absolute magnitude is roughly a linear function of the effective temperature. Then we 
have the following more complicated relation; 



PiTeS + P 2 logs < 2.45 

Pilogp^ + P 2 \ogg + P 3 \ogg > 2.45 


(9) 


Table|4]shows the best fit coefficients of Pi {i = 1,2, 3). It is noted that for both groups of metallicity, the 
synthetic data (red dots) showing in Figs|7]and|8]are not exactly located on a narrow line, but spread out 
with various dispersions. We then measure the dispersions of the residuals of the for the synthetic 
RC stars with respect to the best fit models at different bins of metallicity and show them in the column 
of auK in Tables[3and|4] 

Then, for each observed RC star, we firstly derive the M/f and cmk from its TeS, logp, and [Fe/H], 
and bring them back to Eq |5] to derive the likelihood of the distance module. Because the intrinsic 
color index of the primary and secondary RC stars are quite similar, we adopt this value derived from 
section |T 2 ] 

3.4 Performance assessment 

Before applying the improved distance estimation to the LAMOST data, we assess the performance of 
the improved M/f model demonstrated in section [33] 

We arbitrarily select 1000 points from the synthetic dataset and add random Gaussian errors to 
the true values of Tes, logp, and [Fe/H]. For each synthetic RC star, we create 20 mock stars with 
different random errors. In total, we create 20,000 mock stars with errors. Then, we derive the absolute 
magnitude for the mock data based on the method mentioned in Section 13.31 We totally create 9 sets 
of mock dataset with various measurement errors of logp, T^s, and [Fe/H]. In the first three mock 
datasets, we create the errors with uiogg = 0.1, 0.2, and 0.3 dex, respectively, crT-etf = 120K, and 
(T[Fe/H] = 0.1 dex. The second three mock datasets are created with random errors of = 120, 150, 
and 200 K, respectively, <J\ogg = 0.1 dex, and crpe/H] = 0.1 dex. The last three mock datasets have 
f^[Fe/H] = 0.1, 0.2, and 0.3 dex, respectively, with a-\ogg = 0.1 dex, and ax^tt = 120K. We compare the 
derived absolute magnitudes with the true values in the 9 mock samples. The residuals of the derived 
absolute magnitudes as functions of the errors of the stellar parameters are shown in Fig|9l 

The left panels of Fig|9]show the residuals of M^, denoted as bMif, as a function of the errors of 
logp at tJiogg = 0.1, 0.2, 0.3 dex from top to bottom, respectively. It shows that the larger the uncertainty 
of \ogg, the larger the errors in the absolute magnitude. When the random errors of \ogg are larger, 
the derived seems more overestimated. However, slightly increasing the uncertainties of [Fe/H] 
(the middle panels) and Teff (the right panels) would not significantly add the uncertainties of the Mk 
estimates. Fig[T0]shows the standared deviation of bMif, in terms of the a of the best fit Gaussian with 
the histogram of SMk, as functions of the uncertainties of \ogg (the left panel), [Fe/H] (the middle 
panel), and T^g (the right panel), respectively. Again, this figure shows that the accuracy of the 
estimates relies mostly on the accuracy of logp, rather than that of [Fe/H] and T^g. Therefore, the 
accurate \ogg calibrated with the asteroseismic log^ from Liu et al. (120151) is very important in the high 
accuracy distance estimation. 

Figure [TO] also shows that with the typical uncertainty of 0.1 dex in logp, the uncertainty of the 
derived absolute magnitude is better than 0.2 mag, corresponding to ^ 10 % in distance. 

4 DISCUSSIONS 

4.1 Comparison between two absolute magnitude models 

In section[3 we discuss two approaches to estimate the absolute magnitude for RC stars. It is worthy to 
directly compare the distance estimates based on the two different methods. Fig[TT]shows the difference 
of the distance estimates for the LAMOST RC stars between the fixed absolute magnitude-based and the 
isochrone-based method as a function of logp. It is seen that the fixed magnitude-based method tends 
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to underestimate the RC stars with smaller logg and significantly overestimate those with larger logg. 
The overestimation in large log^ data is because that the fixed absolute magnitude are dominated by 
the primary RC stars and hence may not be suitable for the secondary RC stars. The underestimation 
when logp <2.7dex is likely because the slightly inconsistency between the M.k used in the synthetic 
isochrones and the one derived in section 13.21 In Fig (TT] we find that the systematic bias can shift by 
more than 20% in large \ogg given a fixed absolute magnitude. When logp< 2.3 dex, the isochrone- 
based method may not give reliable estimates for RC stars since this is very close to the boundary 
of the isochrone data (see Fig[^. Therefore the errors increase in this region, as shown in FigfTTl 

4.2 External uncertainty of the distance estimation 

In Section 13.41 we use the mock data created from the synthetic data to test the performance of the 
distance estimation based on the isochrones. This, however, can only give the internal error but not 
the external one. We then cross-identify the RC stars from the LAMOST data with Hipparcos catalog. 
Unfortunately, we only obtain less than 10 common RC stars with parallax error less than 50%. Because 
most of these RC stars suffers larger uncertainty in parallax, they cannot be treated as the standard stars 
to investigate the external error of the isochrone-based distance estimates. To solve this issue, we need 
to wait the coming Gaia data (Bailer-Jones 120091) . which will release the first catalog in 2016. 

5 CONCLUSIONS 

In this work, we set up an empirical model in T^s vs. logp plane to identify the RC stars in the LAMOST 
DR2 data. The employed approach identifies not only the primary, but also the secondary RC stars. This 
will be very helpful for the study of the evolution of the Milky Way, because the range of the RC stars 
can be extended from <1 Gyr (contributed by the secondary RC stars) to lOGyr (contributed by the 
primary RC stars). Finally, we identify 118,711 RC stars from the LAMOST DR2 data with the 95% 
completeness level. The sample of the selected RC stars may be contaminated by the RGB stars with 
the fraction of about 20%. 

After the identification of the RC stars, we develop two different approaches to estimate their dis¬ 
tances as well as the interstellar extinction. The first one is based on the fixed absolute magnitude 
and intrinsic color index. The accuracy of the fixed absolute magnitude-based approach relies on the 
accuracy of the determination of the absolute magnitude and the dispersion of the absolute magnitude. 
Consequently, we revisit the absolute magnitude and its dispersion for RC stars in K band. We adopt the 
similar empirical model introduced by Paczynski & Stanek (119981) . but take into account the interstellar 
extinction for the Hipparcos RC stars. Although the extinction is small, it leads to the underestimation 
of the absolute magnitude by about 2%. 

Although this method is sufficiently accurate for most of the primary RC stars, it is not suitable for 
the secondary RC stars, which significantly vary in logp and hence also in the absolute magnitude. We 
then develop the second approach considering both types of RC stars based on the isochrones. With the 
empirical model, we associate the absolute magnitude of a RC star with its [Fe/H], logp, and T^s- The 
more delicate model can reduce the uncertainty of the distance estimates to 10% for almost all types of 
RC stars given the error of logp at around 0.1 dex. 

This RC sample well samples the Galactic disk, particularly in the outer disk, allowing us to map 
the structure, kinematics, and evolution of the Galactic disk in future works. 
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Table 1 The location of the RC candidates in the right panel in Fig [3 


[Fe/HJ 

Completeness level 

Contour level 

^RC 

I'otal ratio ot RC stars 

(-0.6,-0.3) 

68% 

509 

36099 

89.68% 


95% 

15.46 

54890 

80.47% 

(-0.3,0.0) 

58% 

42:25 

31553 

83.32% 


95% 

10.46 

48064 

76.29% 

(0.0,0.4) 

58% 

12.12 

11291 

84.87% 


95% 

3.01 

15757 

81.13% 


Notes: 

Contour level: the contour level in Fig|3]coiTesponding to 68% or 95% completeness. 

: number of RC candidates enclosed in the contour, (with the addition cut around the edges, see 
white lines in the right panels of Fig[3 

Ratio of RC stars: the total fraction of RC stars to the all metal-rich giant stars within the contour level. 

Table 2 The derived absolute magnitude and intrinsic color index of RC stars. 



Mk 

C^Mfc 

J-K 

O-J-K 


mag 

mag 

mag 

mag 

reddened 

— 1.529 ± U.UU3 

0.075 ± 0.003 

0.681 ± 0.003 

0.104 ±0.005 

dereddened 

- 1.549 ±0.003 

0.076 ± 0.003 

0.658 ± 0.001 

0.072 ±0.001 


Table 3 The coefficients at different [Fe/H] bins for the model of metal-rich RC stars 



Mk = 

Pilog‘‘g + P 2 \ogg ± P 3 

[Fe/HJ 

Jr’i 

J^2 

-^3 

ctmk 

(-0.3,-0.2) 

6.72 

-33.94 

41.15 

0.12 

(-0.2,-0.1) 

6.67 

-33.55 

40.49 

0.10 

(-0.1,0.0) 

6.21 

-31.25 

37.64 

0.11 

(0.0,0.1) 

6.17 

-31.00 

37.26 

0.10 

(0.1,0.2) 

6.19 

-31.05 

37.26 

0.09 


Table 4 The coefficients at different [Fe/H] bins for the Mk model of metal-poor RC stars 



logg < 2.45 

Mk = PiTeff + P2 

Mk = 

logg > 2.45 

Pilog^g ± P 2 logg ± P 3 

[Fe/H] 


J^2 

(TMk 

-^*1 

J^2 


umk 

(-0.6,-0.5) 

0.0024 

-13.65 

0.15 

7.87 

-39.98 

49.06 

0.09 

(-0.5,-0.4) 

0.0026 

-14.50 

0.14 

6.28 

-31.74 

38.41 

0.08 

(-0.4,-0.3) 

0.0027 

-14.73 

0.14 

6.54 

-33.09 

40.16 

0.08 
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Fig. 1 The observed distribution of the metal-rich ([Fe/H] > —0.6) giant branch stars from 
LAMOST DR2 in logg vs. T^e plane. The bin size is Alogp = 0.02 and ATes = 20K. 
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Fig. 2 Fitting the coefficients in Eq[T] The solid lines are the best fit curve, while the cross 
symbols are measured from RGB stars with \ogg< 2 and > 3. From left to right, the metal- 
licity bins are (—0.6, —0.3), (—0.3,0.0), and (0.0,0.4), respectively. The top panels show the 
best fit smoothing splines for N, the bottom panels show the best fit splines for cr^. The rest 
panels show the best fit polynomials, which coeffecients are appeared in Eq|2] 
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Fig. 3 From top to bottom, the metallicity bins are (—0.6, —0.3), (—0.3,0.0), and (0.0,0.4), 
respectively. The left panels present the distributions of the full sample of the giant stars with 
different metallicity, respectively. The middle panels present the best fit distribution models 
of RGB stars. The right panels present the distributions of the residuals of full sample in the 
left panels after subtracting the RGB distributions in the middle panels. They are mostly con¬ 
tributed by the RC stars. The red and yellow contours show the 68% and 95% completeness 
of the RC candidates, respectively. The horizontal and vertical white lines give cuts for stars 
with logp > 2.9 dex, logp < 2.1 dex and TeS > 5200K, respectively. The leaning white line 
located at logp= 0.0016Teff—4.7170 gives another cut for removing the bump stars located 
at the bottom-right corner at the 95% contour in the bottom-right panel. The color codes the 
stellar count in the bins. 
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Fig. 4 The colors show the fractions of true RC stars with different metallicity bins, 
(—0.6, —0.3), (—0.3,0.0), and (0.0, 0.4), from left to right panel, respectively, in the logp 
vs. Teff plane. 
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J-K or (J-K)o 


Fig. 5 The top-left panel shows the reddened J — K \s. Mk + Ak diagram for about 900 
giant stars selected from Hipparcos catalog. The middle-left panel shows the similar plot 
but with the dereddened color index and absolute magnitude. The bottom panel shows the 
marginalized distribution of the reddened J — K for the stars (corresponding to the top-left 
panel) with cross symbols and that of the dereddened color index (J — K)q (corresponding 
to the middle-left panel) with the square sybmols. The dashed and solid lines are the best fit 
model of Eq (|7]i for the J — K and (J — K)o, respectively. The top-right panel shows the 
marginalized distribution of M-k+Ak with cross symbols. The dashed line stands for the best 
fit model according to Eq. (|6]l. The middle-right panel shows the marginalized distribution of 
the dereddened Mk with square symbols, and the best fit model with the solid line. 
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Fig. 6 Panel (a): The synthetic logg vs. Mk diagram for RC stars from PARSEC stellar 
evolution track(Bressan et al. 120121 1. The range of Age is from 4Myr to 13Gyr with steps of 
A(logf) = 0.05. The ranges of Metallicity [Fe/H] and Mini are —0.6 ~ 0.3 (Zq=0.0152) 
and 0.8 ^ 2Mq respectively. Panel (b)-(e): The Xogg-M-K relation for RC stars with Mini 
between 0.8 ~ I.IMq, 1.1 ^ 1.4Mq, 1.4 ^ 1.7Mq, and 1.7 ^ 2.0Mq, respectively. 
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Fig. 7 For RC stars with logf; > 2.45, the Mk is modeled as quadratic polynomials of logp in 
each [Fe/H] bin. The red dots are the synthetic data and the blue lines are the best fit quadratic 
polynomials. 
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^eff 


Fig. 8 For RC stars with logg < 2.45 and [Fe/H]< —0.3, the M/f is modeled linearly as a 
function of Teff in each [Fe/H] bin. The red dots are the synthetic data and the blue lines are 
the best ht lines. 
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(51og5 


5[Fe/E\ 


Fig. 9 The scatter plot of the residuals of derived Mjc, denoted by SMk, for 20,000 simulate 
data with various uncertainties in \ogg (left panels), [Fe/H] (middle panels), and Tgg (right 
panels). 
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Fig. 10 The relationship between the standard deviation of the residual Mk, denoted by auK > 
and the uncertainty of log 5 (triogg), [Fe/H] (crpe/H]). and Tes 



logff 


Fig. 11 The comparison between the the isochrone-based (Diso) and fixed absolute 
magnitude-based (DaxM^) distance estimates at various logp. 










